Hybrid, Interpretable Machine Learning for Thermodynamic Property Estimation using Grammar2vec for Molecular Representation

نویسندگان

چکیده

Property prediction models have been developed for several decades with varying degrees of performance and complexity, from the group contribution-based methods to molecular simulations-based methods. An interesting issue in this area is finding an appropriate representation molecules inherently suited property modeling problem. Here, we propose Grammar2vec, a SMILES grammar-based framework generating dense, numeric representations. Grammar2vec embeds structural information contained grammar rules underlying string representations molecules. We use build machine learning-based estimating normal boiling point (Tb) critical temperature (Tc) benchmark their against popularly used contribution (GC)-based To ensure interpretability ML model, perform Shapley values-based analysis estimate feature importance simplify (or prune) trained model.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Machine Learning Method for Intrusion Detection

Data security is an important area of concern for every computer system owner. An intrusion detection system is a device or software application that monitors a network or systems for malicious activity or policy violations. Already various techniques of artificial intelligence have been used for intrusion detection. The main challenge in this area is the running speed of the available implemen...

متن کامل

channel estimation for mimo-ofdm systems

تخمین دقیق مشخصات کانال در سیستم های مخابراتی یک امر مهم محسوب می گردد. این امر به ویژه در کانال های بیسیم با ‏خاصیت فرکانس گزینی و زمان گزینی شدید، چالش بزرگی است. مقالات متعدد پر از روش های مبتکرانه ای برای طراحی و آنالیز ‏الگوریتم های تخمین کانال است که بیشتر آنها از روش های خاصی استفاده می کنند که یا دارای عملکرد خوب با پیچیدگی ‏محاسباتی بالا هستند و یا با عملکرد نه چندان خوب پیچیدگی پایینی...

Machine Learning Models for Housing Prices Forecasting using Registration Data

This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...

متن کامل

Making machine learning models interpretable

Data of different levels of complexity and of ever growing diversity of characteristics are the raw materials that machine learning practitioners try to model using their wide palette of methods and tools. The obtained models are meant to be a synthetic representation of the available, observed data that captures some of their intrinsic regularities or patterns. Therefore, the use of machine le...

متن کامل

Interpretable Machine Learning for Privacy-Preserving IoT and Pervasive Systems

The presence of pervasive computing in our everyday lives and emergence of the Internet of Things, such as the interaction of users with connected devices like smartphones or home appliances generate increasing amounts of traces that reflect users’ behavior. A plethora of machine learning techniques enable service providers to process these traces to extract latent information about the users. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Fluid Phase Equilibria

سال: 2022

ISSN: ['0378-3812', '1879-0224']

DOI: https://doi.org/10.1016/j.fluid.2022.113531